智能论文笔记

Out-of-Distribution Detection for LiDAR-based 3D Object Detection

Chengjie Huang , Van Duong Nguyen , Vahdat Abdelzad , Christopher Gus Mannes , Luke Rowe , Benjamin Therien , Rick Salay , Krzysztof Czarnecki

分类：计算机视觉 | 人工智能 | 机器学习 | 机器人

2022-09-28

3D对象检测是自动驾驶的重要组成部分，深层神经网络（DNNS）已达到此任务的最新性能。但是，深层模型臭名昭著，因为将高置信度得分分配给分布（OOD）输入，即未从训练分布中得出的输入。检测OOD输入是具有挑战性的，对于模型的安全部署至关重要。已经针对分类任务进行了广泛研究OOD检测，但是它尚未对对象检测任务，特别是基于激光雷达的3D对象检测的注意力。在本文中，我们关注基于激光雷达的3D对象检测的OOD输入的检测。我们制定了OOD输入对于对象检测的含义，并提议适应几种OOD检测方法进行对象检测。我们通过提出的特征提取方法来实现这一目标。为了评估OOD检测方法，我们开发了一种简单但有效的技术，用于为给定的对象检测模型生成OOD对象。我们基于KITTI数据集的评估表明，不同的OOD检测方法具有检测特定OOD对象的偏差。它强调了联合OOD检测方法的重要性以及在这个方向上进行更多研究。

translated by 谷歌翻译

A Safety Assurable Human-Inspired Perception Architecture

Rick Salay , Krzysztof Czarnecki

分类：机器学习 | 人工智能 | 机器人

2022-05-10

尽管使用深神经网络（DNN）的基于人工智能的感知（AIP）接近人类水平的表现，但其众所周知的局限性是对自主应用所需的安全保证的障碍。这些包括对对抗性输入的脆弱性，无法处理新的输入和无解释性。在解决这些局限性方面的研究中，在本文中，我们认为需要一种根本不同的方法来解决它们。受到人类认知的双重过程模型的启发，其中1型思维是快速且无意识的，而2型思维则缓慢并且基于有意识的推理，我们为安全AIP提出了双重过程体系结构。我们回顾了有关人类如何解决最简单的非平凡感知问题，图像分类的研究，并为此任务绘制相应的AIP体系结构。我们认为，这种体系结构可以提供一种系统的方法来解决使用DNNS的AIP局限性，并可以保证人类水平的绩效及以后的方法。最后，我们讨论了现有工作可能已经解决了哪些架构的组成部分以及未来的工作。

translated by 谷歌翻译

The missing link: Developing a safety case for perception components in automated driving

Rick Salay , Krzysztof Czarnecki , Hiroshi Kuwajima , Hirotoshi Yasuoka , Toshihiro Nakae , Vahdat Abdelzad , Chengjie Huang , Maximilian Kahn , Van Duong Nguyen

分类：机器学习 | 人工智能 | 机器人

2021-08-30

安全保证是自动驾驶（AD）系统发展和社会接受（AD）系统的核心问题。感知是广告的关键方面，严重依赖机器学习（ML）。尽管基于ML的组件的安全性有已知的挑战，但最近已经提出针对解决这些组件的单位安全案例的建议。不幸的是，AD安全案例在系统级别上表示安全要求，这些努力缺少将安全性要求与单位级别的组件性能要求整合在一起所需的关键链接参数。在本文中，我们提出了感知的集成安全案例（ISCAP），这是针对专门针对感知组件量身定制的这种链接安全参数的通用模板。该模板采用演绎且形式上的方法来定义级别之间强大的可追溯性。我们通过详细的案例研究证明了ISCAP的适用性，并讨论了其作为支持感知成分增量发展的工具的使用。

translated by 谷歌翻译

EvidenceCap: Towards trustworthy medical image segmentation via evidential identity cap

Ke Zou , Xuedong Yuan , Xiaojing Shen , Yidi Chen , Meng Wang , Rick Siow Mong Goh , Yong Liu , Huazhu Fu

分类：计算机视觉

2023-01-01

Medical image segmentation (MIS) is essential for supporting disease diagnosis and treatment effect assessment. Despite considerable advances in artificial intelligence (AI) for MIS, clinicians remain skeptical of its utility, maintaining low confidence in such black box systems, with this problem being exacerbated by low generalization for out-of-distribution (OOD) data. To move towards effective clinical utilization, we propose a foundation model named EvidenceCap, which makes the box transparent in a quantifiable way by uncertainty estimation. EvidenceCap not only makes AI visible in regions of uncertainty and OOD data, but also enhances the reliability, robustness, and computational efficiency of MIS. Uncertainty is modeled explicitly through subjective logic theory to gather strong evidence from features. We show the effectiveness of EvidenceCap in three segmentation datasets and apply it to the clinic. Our work sheds light on clinical safe applications and explainable AI, and can contribute towards trustworthiness in the medical domain.

translated by 谷歌翻译

FedUKD: Federated UNet Model with Knowledge Distillation for Land Use Classification from Satellite and Street Views

Renuga Kanagavelu , Kinshuk Dua , Pratik Garai , Susan Elias , Neha Thomas , Simon Elias , Qingsong Wei , Goh Siow Mong Rick , Liu Yong

分类：计算机视觉 | 机器学习

2022-12-05

Federated Deep Learning frameworks can be used strategically to monitor Land Use locally and infer environmental impacts globally. Distributed data from across the world would be needed to build a global model for Land Use classification. The need for a Federated approach in this application domain would be to avoid transfer of data from distributed locations and save network bandwidth to reduce communication cost. We use a Federated UNet model for Semantic Segmentation of satellite and street view images. The novelty of the proposed architecture is the integration of Knowledge Distillation to reduce communication cost and response time. The accuracy obtained was above 95% and we also brought in a significant model compression to over 17 times and 62 times for street View and satellite images respectively. Our proposed framework has the potential to be a game-changer in real-time tracking of climate change across the planet.

translated by 谷歌翻译

Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images

Meng Wang , Kai Yu , Chun-Mei Feng , Ke Zou , Yanyu Xu , Qingquan Meng , Rick Siow Mong Goh , Yong Liu , Xinxing Xu , Huazhu Fu

分类：计算机视觉

2022-12-01

Focusing on the complicated pathological features, such as blurred boundaries, severe scale differences between symptoms, background noise interference, etc., in the task of retinal edema lesions joint segmentation from OCT images and enabling the segmentation results more reliable. In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network, which can provide accurate segmentation results with reliability assessment. Specifically, aiming at improving the model's ability to learn the complex pathological features of retinal edema lesions in OCT images, we develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module of our newly designed. Meanwhile, to make the segmentation results more reliable, a novel uncertainty segmentation head based on the subjective logical evidential theory is introduced to generate the final segmentation results with a corresponding overall uncertainty evaluation score map. We conduct comprehensive experiments on the public database of AI-Challenge 2018 for retinal edema lesions segmentation, and the results show that our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches. The code will be released on: https://github.com/LooKing9218/ReliableRESeg.

translated by 谷歌翻译

Desire Backpropagation: A Lightweight Training Algorithm for Multi-Layer Spiking Neural Networks based on Spike-Timing-Dependent Plasticity

Daniel Gerlinghoff , Tao Luo , Rick Siow Mong Goh , Weng-Fai Wong

分类：神经与进化计算 | 人工智能

2022-11-10

Spiking neural networks (SNN) are a viable alternative to conventional artificial neural networks when energy efficiency and computational complexity are of importance. A major advantage of SNNs is their binary information transfer through spike trains. The training of SNN has, however, been a challenge, since neuron models are non-differentiable and traditional gradient-based backpropagation algorithms cannot be applied directly. Furthermore, spike-timing-dependent plasticity (STDP), albeit being a spike-based learning rule, updates weights locally and does not optimize for the output error of the network. We present desire backpropagation, a method to derive the desired spike activity of neurons from the output error. The loss function can then be evaluated locally for every neuron. Incorporating the desire values into the STDP weight update leads to global error minimization and increasing classification accuracy. At the same time, the neuron dynamics and computational efficiency of STDP are maintained, making it a spike-based supervised learning rule. We trained three-layer networks to classify MNIST and Fashion-MNIST images and reached an accuracy of 98.41% and 87.56%, respectively. Furthermore, we show that desire backpropagation is computationally less complex than backpropagation in traditional neural networks.

translated by 谷歌翻译

Localizing Anatomical Landmarks in Ocular Images using Zoom-In Attentive Networks

Xiaofeng Lei , Shaohua Li , Xinxing Xu , Huazhu Fu , Yong Liu , Yih-Chung Tham , Yangqin Feng , Mingrui Tan , Yanyu Xu , Jocelyn Hui Lin Goh

分类：计算机视觉 | 机器学习

2022-09-25

Localizing anatomical landmarks are important tasks in medical image analysis. However, the landmarks to be localized often lack prominent visual features. Their locations are elusive and easily confused with the background, and thus precise localization highly depends on the context formed by their surrounding areas. In addition, the required precision is usually higher than segmentation and object detection tasks. Therefore, localization has its unique challenges different from segmentation or detection. In this paper, we propose a zoom-in attentive network (ZIAN) for anatomical landmark localization in ocular images. First, a coarse-to-fine, or "zoom-in" strategy is utilized to learn the contextualized features in different scales. Then, an attentive fusion module is adopted to aggregate multi-scale features, which consists of 1) a co-attention network with a multiple regions-of-interest (ROIs) scheme that learns complementary features from the multiple ROIs, 2) an attention-based fusion module which integrates the multi-ROIs features and non-ROI features. We evaluated ZIAN on two open challenge tasks, i.e., the fovea localization in fundus images and scleral spur localization in AS-OCT images. Experiments show that ZIAN achieves promising performances and outperforms state-of-the-art localization methods. The source code and trained models of ZIAN are available at https://github.com/leixiaofeng-astar/OMIA9-ZIAN.

translated by 谷歌翻译

Communication-Efficient {Federated} Learning Using Censored Heavy Ball Descent

Yicheng Chen , Rick S. Blum , Brian M. Sadler

分类：机器学习 | 人工智能

2022-09-24

分布式机器学习实现可扩展性和计算卸载，但需要大量的通信。因此，分布式学习设置中的沟通效率是一个重要的考虑因素，尤其是当通信是无线且采用电池驱动设备时。在本文中，我们开发了一种基于审查的重球（CHB）方法，用于在服务器工作者体系结构中分布式学习。除非其本地梯度与先前传播的梯度完全不同，否则每个工人的自我审查员。 HB学习问题的显着实际优势是众所周知的，但是尚未解决降低通信的问题。 CHB充分利用HB平滑来消除报告的微小变化，并证明达到了与经典HB方法相当的线性收敛速率，以平滑和强烈凸出目标函数。 CHB的收敛保证在理论上是合理的，对于凸和非凸案。此外，我们证明，在某些情况下，至少可以消除所有通信的一半，而不会对收敛率产生任何影响。广泛的数值结果验证了CHB在合成和真实数据集（凸，非凸和非不同情况）上的通信效率。鉴于目标准确性，与现有算法相比，CHB可以显着减少通信数量，从而实现相同的精度而不减慢优化过程。

translated by 谷歌翻译

Concatenated Classic and Neural (CCN) Codes: ConcatenatedAE

Onur Günlü , Rick Fritschek , Rafael F. Schaefer

分类：机器学习

2022-09-04

显示用于误差校正的小型神经网络（NNS）可改善经典通道代码并解决通道模型更改。我们通过多次使用相同的NN使用相同的NN扩展了任何此类结构的代码维度，这些NN与外部经典代码串行串联。我们设计具有相同网络参数的NN，其中每个REED - Solomon CodeWord符号都是对其他NN的输入。与小型神经代码相比，增加了加斯噪声通道的块误差概率的显着改善，以及通道模型变化的稳健性。

translated by 谷歌翻译